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Non-value added data activity reduction for business continuum, involves 
aligning business processes to data structure, after eliminating 
inaccuracy, inconsistency, duplication and unnecessary gap in it 

Patent Assignee: BIALEK G C (BIAL-I); COOTE P (COOT-I); KRAMER J F (KRAM-I) 

Inventor: BIALEK G C ; COOTE P ; KRAMER J P 
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Patent No Kind Lan Pg Main IPC Filing Notes 
US 20020138484 Al 11 G06F-017/60 

Abstract (Basic) : US 20020138484 Al 

NOVELTY - The business processes and corresponding data 
requirements are aligned to the data sets of a data structure, based on 
the attributes defined for each data set after eliminating unnecessary 
gaps, duplications, inconsistencies and inaccuracies among the data 
sets. The alignment is updated in response to the changes in business 
processes, corresponding data requirements and attributes. 

DETAILED DESCRIPTION - An INDEPENDENT CLAIM is included for data 
structure comprising data sets. 

USE - In database management application for reducing non-value 
added data activity such as redundant entry of data, unnecessary data 
transformation, unnecessary data reconciliation and resolution of 
missing, inaccurate or inconsistent data across business continuum. 

ADVANTAGE - The efficiency and profitability of the overall 
business enterprise, are increased by minimizing non-value added data 
activity across the business continuum. 

DESCRIPTION OF DRAWING (S) - The figure shows the flow diagram 
explaining continuum. 
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Set Items Description 

51 10555 DATABASE? OR DATABANK? OR DATA ( ) (BASE? OR BANK?) OR DB OR - 

RDB OR DBMS OR OODB OR ORACLE OR SYBASE OR SQL OR DATA ( ) (BASE? 
OR BANK?) 

52 869 S1(4N) (MULTIPLE OR MULTIPLICITY OR PLURAL OR PLURALITY OR - 

MIRROR? OR MANY OR SEVERAL OR DIFFERENT OR VARIOUS OR VARIETY 
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53 9164 COMBINE? OR COMBINING OR CONSOLIDAT? OR DISTRIBUTED OR SHA- 

RED OR NETWORKED OR GROUPWARE? 
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55 6 01 GLOSSAR? OR DICTIONAR? OR LESICOG? OR LEXICON? OR THESAURI? 

OR WORDLIST? OR WORD ( ) (LIST OR LISTS) 
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check 

Author(s): Easterbrook, S.; Finkelstein, A.; Kramer, J. ; Nuseibeh, B. 
Author Affiliation: Sch. of Cognitive & Comput. Sci., Sussex Univ., 
Brighton, UK 

Journal: Concurrent Engineering: Research and Applications vol.2, no . 3 
p. 209-22 

Publication Date: Sept. 1994 Country of Publication: UK 
CODEN: CRAPEM ISSN: 1063-293X 

U.S. Copyright Clearance Center Code: 1063-293X/94/030209+14$08 . 00/0 
Language: English Document Type: Journal Paper (JP) 
Treatment: Practical (P) 

Abstract: Support for concurrent engineering must address the "multiple 
perspectives problem" -many actors, many representation schemes, diverse 
domain knowledge, and differing development strategies, all in the context 
of distributed asynchronous development. Central to this problem is the 
issue of managing consistency between the various elements of an emerging 
design. In this paper we argue that striving to maintain complete 
consistency at all points in the development process is unnecessary , and 
an approach based on tolerance and management of inconsistency can be 
adopted instead. We present a scenario which highlights a number of 
important issues raised by this approach, and we describe how these issues 
are addressed in our framework of distributed Viewpoints. The approach 
allows an engineering team to develop independent Viewpoints, and to 
establish relationships between them incrementally. The framework provides 
mechanisms for expressing consistency relationships, checking that 
individual relationships hold, and resolving inconsistencies if necessary. 
(21 Refs) 

Subfile: C 

Descriptors: concurrent engineering; groupware 

Identifiers: Viewpoints; consistency check; concurrent engineering; 
multiple perspectives problem; diverse domain knowledge; development 
strategies; distributed asynchronous development; inconsistency resolution 

Class Codes: C7480 (Production engineering computing); C3355 (Control 
applications in manufacturing processes) ; C6130G (Groupware) 

Copyright 1995, I EE 
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WPI Acc No: 2003-017987/200301 

XRPX Acc No: N03- 013862 

Non-value added data activity reduction for business continuum/ involves 
aligning business processes to data structure, after eliminating 
inaccuracy, inconsistency, duplication and unnecessary gap in it 

Patent Assignee: BIALEK G C (BIAL-I); COOTE P (COOT-I); KRAMER J F (KRAM-I) 

Inventor: BIALEK G C ; COOTE P ; KRAMER J F 

Number of Countries: 001 Number of Patents: 001 

Patent Family: 

Patent No Kind Date Applicat No Kind Date Week 

US 20020138484 Al 20020926 US 2001681361 A 20010326 200301 B 

Priority Applications (No Type Date) : US 2001681361 A 20010326 
Patent Details: 

Patent No Kind Lan Pg Main IPC Filing Notes 
US 20020138484 Al 11 G06F-017/60 

Abstract (Basic) : US 20020138484 Al 

NOVELTY - The business processes and corresponding data 
requirements are aligned to the data sets of a data structure, based on 
the attributes defined for each data set after eliminating unnecessary 
gaps, duplications, inconsistencies and inaccuracies among the data 
sets. The alignment is updated in response to the changes in business 
processes, corresponding data requirements and attributes. 

DETAILED DESCRIPTION - An INDEPENDENT CLAIM is included for data 
structure comprising data sets. 

USE - In database management application for reducing non-value 
added data activity such as redundant entry of data, unnecessary data 
transformation, unnecessary data reconciliation and resolution of 
missing, inaccurate or inconsistent data across business continuum. 

ADVANTAGE - The efficiency and profitability of the overall 
business enterprise, are increased by minimizing non-value added data 
activity across the business continuum. 

DESCRIPTION OF DRAWING (S) - The figure shows the flow diagram 
explaining continuum. 

pp; 11 DwgNo 3/10 
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ALIGN; BUSINESS; PROCESS; DATA; STRUCTURE; AFTER; ELIMINATE; INACCURACIES 
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International Patent Class (Main) : G06P-017/60 

International Patent Class (Additional) : G06P-007/00 ; G06F-017/30 
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Company financial data analyzing and comparing method, involves applying 
three subroutines to data if they are applicable, and electronically 
reporting results of subroutines to identify factors causing changes in 
revenue and cost 

Patent Assignee: FORD MOTOR CO (FORD ) 

Inventor: COOTE P ; LOUGOVIER S A; PRZYBOCKI P; TOMILO M; YUN D 
Number of Countries: 001 Number of Patents: 001 
Patent Family: 

Patent No Kind Date Applicat No Kind Date Week 

US 6856972 Bl 20050215 US 2000635827 A 20000810 200518 B 

Priority Applications (No Type Date) : US 2000635827 A 20000810 
Patent Details: 

Patent No Kind Lan Pg Main IPC Filing Notes 
US 6856972 Bl 9 G06F-017/60 

Abstract (Basic) : US 6856972 Bl 

NOVELTY - The method involves gathering data in two different 
financial statements and finding applicability of a subroutine to the 
data. The subroutine is applied to the data if it is applicable. The 
applicability of another two subroutines is found and the subroutines 
are applied, if they are applicable. Results of the subroutines are 
electronically reported to identify underlying factors which cause 
changes in revenue and cost . 

USE - Used for analyzing and comparing financial data for a 
company . 

ADVANTAGE - The method facilitates for electronically reporting 
results of the subroutines to identify factors causing changes in 
revenue and cost, thus automatically analyzing the financial data. The 
method reduces time and expenses to the company. 

DESCRIPTION OF DRAWING (S) - The drawing shows a flow chart of a 
method for analyzing and comparing financial data. 
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Title Terms: COMPANY; FINANCIAL; DATA; COMPARE; METHOD; APPLY; THREE; 
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International Patent Class (Main) : G06P-017/60 
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04740916 E.I. No: EI P97073727978 

Title: Specification and management of interdependent data in operational 
systems and data warehouses 

Author: Georgakopoulos, Dimitrios; Karabatis, George; Gantimahapatruni , 
Sridhar 

Corporate Source: GTE Lab Inc, Waltham, MA, USA 

Source: Distributed and Parallel Databases v 5 n 2 Apr 1997. p 121-166 

Publication Year: 1997 

CODEN: DAATES ISSN: 0926-8782 

Language : Engl i sh 

Document Type: JA; (Journal Article) Treatment: G; (General Review) 
Journal Announcement: 9708W4 

Abstract: (Inter) Dependent objects include data replicated or cached in 
multiple database systems, data collected and summarized in data 
warehouses for analysis, planning, and decision support, as well as any 
other category of objects whose states are related and they are maintained 
in different information systems. In this paper we discuss dependencies 
between objects in an environment consisting of operational systems and a 
data warehouse , and describe their specification and enforcement. To 
specify object dependencies we introduce Object Dependency Descriptors 
(ObjectDDs) . These describe the relationships between dependent objects, 
and define how much inconsistency between original objects and their 
replicas/collections/summaries can be tolerated before it is necessary to 
restore their consistency. Object dependencies are enforced by extended 
transactions designed specifically for evaluating if dependent objects 
satisfy their specified relationships, evaluating whether possible 
inconsistencies can be tolerated, and (if not) restoring consistency. To 
describe the transactional behavior of such consistency evaluation and 
restoration transactions we use Transaction Dependency Descriptors 
(Transact ionDDs) . TransactionDDs define the transactional relationships 
between consistency evaluation and restoration (asynchronous) transactions, 
as well as the relationships between such asynchronous transactions and 
regular (synchronous) transactions executed directly by applications. To 
automatically maintain the consistency of dependent objects, we propose the 
concept of a Dependency Management System (DMS) . A DMS monitors dependent 
objects, evaluates object consistency, and schedules and controls 
consistency restoration transactions to keep dependent objects within 
acceptable levels. We describe key components in the DMS architecture, and 
a relatively simple implementation involving straightforward extensions in 
a relational DBMS . (Author abstract) 42 Refs. 

Descriptors: *Distribute d database systems; Data structures; Data 
acquisition; Data reduction; Computer operating procedures; Resource 
allocation; Computer control; Computer architecture 

Identifiers: Object dependency descriptors (ObjectDD) ; Transaction 
dependency descriptors (Transact ionDD) ; Dependency management system (DMS) 

Classification Codes: 

723.3 (Database Systems); 723.2 (Data Processing); 912.2 (Management); 
723.5 (Computer Applications) 

723 (Computer Software) ; 722 (Computer Hardware) ; 912 (Industrial 
Engineering & Management) 
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Titles Application of machine learning in multidatabase schema 
integration 

Author: Azarbod, C. ; Perrizo, W. 

Corporate Source: Mankato State Univ, Mankato, MN, USA 

Source: Microcomputer Applications v 14 n 2 1995. p 81-90 

Publication Year: 1995 

CODEN: MIAPEZ ISSN: 0820-0750 

Language : English 

Document Type: JA; (Journal Article) Treatment: A; (Applications); T; 
(Theoretical) 

Journal Announcement: 9602W4 

Abstract: The purpose of this paper is to discuss the rule of incremental 
concept- formation algorithms in the integration of local database schemes 
developed at different sites into a global/federated database schema. 
Schema integration is a major concern in Heterogeneous Distributed Data 
Base Management Systems (HDDBMSs) . In a HDDBMS, schema integration is the 
process of discovering and representing the semantic relationships among 
pre-existing databases and providing a means of resolving inconsistencies 
without physically modifying or consolidating these databases. Because 
understanding the real semantics of data is essential in schema 
integration, we propose to use semantic data modelling as a common 
foundation between participating local database schemas . All local database 
schemas should be presented using the Local Semantic Data Modelling (LSDM) 
format. We have extended the incremental concept -formation models developed 
in the machine-learning paradigm, to build a concept-learning model that 
produces concept hierarchies and a concept dictionary . This learning 
model will then be used, through a series of algorithms, to integrate 
(LSDM) into a Federated Semantic Data Model (FSDM) . Appropriate mapping is 
provided to link the local database schemas into a federated schema. 
(Author abstract) 19 Refs. 

Descriptors: *Learning algorithms; Distributed database systems; 
Computational linguistics; Data structures ; Computer simulation; Errors 
; Data processing 

Identifiers: Machine learning; Multidatabase schema integration; Semantic 
data modeling; Incremental concept formation 
Classification Codes: 

723.3 (Database Systems); 721.1 (Computer Theory, Includes Formal 
Logic, Automata Theory, Switching Theory, Programming Theory); 723.2 (Data 
Processing); 723.5 (Computer Applications) 

723 (Computer Software) ; 721 (Computer Circuits & Logic Elements) 
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Title: Flexible relation: an approach for integrating data from multiple 
, possibly inconsistent databases 

Author: Agarwal, Shailesh; Keller, Arthur M. ; Wiederhold, Gio; Saraswat, 
Krishna 

Corporate Source: Persistence Software, San Mateo, CA, USA 
Conference Title: Proceedings of the 1995 - IEEE 11th International 
Conference on Data Engineering 

Conference Location: Taipei, Taiwan Conference Date: 19950306-19950310 
Sponsor: IEEE; National Tsing Hua University 
E.I. Conference No.: 43 044 

Source: Proceedings - IEEE International Conference on Data Engineering 
1995. IEEE, Los Alamitos, CA, USA. p 495-504 
Publication Year: 1995 
CODEN: 002 055 ISSN: 1063-6382 
Language: English 

Document Type: CA; (Conference Article) Treatment: A; (Applications); T 
; (Theoretical) 

Journal Announcement: 9507W4 

Abstract: In this work we address the problem of dealing with data 
inconsistencies while integrating data sets derived from multiple 
autonomous relational databases. The fundamental assumption in the 
classical relational model is that data is consistent and hence no support 
is provided for dealing with inconsistent data. Due to this limitation of 
the classical relational model, the semantics for detecting, representing, 
and manipulating inconsistent data have to be explicitly encoded in the 
applications by the application developer. In this paper, we propose the 
flexible relational model, which extends the classical relational model by 
providing support for inconsistent data. We present a flexible relation 
algebra, which provides semantics for database operations in the presence 
of potentially inconsistent data. Finally, we discuss issues raised for 
query optimization when the data may be inconsistent. (Author abstract) 23 
Ref s . 

Descriptors: *Dat a structures ; Relational database systems; 
Computational linguistics; Data processing; Query languages; Interfaces 
(materials) ; Computer hardware; Computer operating systems 

Identifiers: Flexible relation; Data integration; Data inconsistencies 

Classification Codes: 

723.2 (Data Processing); 723.3 (Database Systems); 721.1 (Computer 
Theory, Includes Formal Logic, Automata Theory, Switching Theory, 
Programming Theory); 722.2 (Computer Peripheral Equipment) 

723 (Computer Software) ; 721 (Computer Circuits & Logic Elements) ; 722 
(Computer Hardware) 
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ADAPTIVE DETECTION OF APPROXIMATELY DUPLICATE DATABASE RECORDS AND THE 
DATABASE INTEGRATION APPROACH TO INFORMATION DISCOVERY 

Author: MONGE, ALVARO EDMUNDO 
Degree: PH.D. 
Year: 1997 

Corporate Source/Institution: UNIVERSITY OF CALIFORNIA, SAN DIEGO (0033) 
Chairperson: CHARLES P. ELKAN 

Source: VOLUME 58/08-B OF DISSERTATION ABSTRACTS INTERNATIONAL. 

PAGE 4315. 98 PAGES 
Descriptors: COMPUTER SCIENCE ; INFORMATION SCIENCE 
Descriptor Codes: 0984; 0723 

The integration of information is an important area of research in 
databases . By combining multiple information sources, a more complete 
view of the world is attained, and additional knowledge gained. This is a 
non-trivial task however. Often there are many sources which contain 
information about a certain kind of entity, and some will contain records 
concerning the same real -world entity. Thus, one problem in integrating 
information sources is to identify possibly different designators of the 
same entity. This thesis provides solutions to this data cleansing problem. 
The integration of information sources is also proposed as an approach for 
information retrieval over the worldwide web. 

Data cleansing is the process of purging databases of inaccurate or 
inconsistent data. The data is manipulated into a form which is useful for 
other tasks, such as data mining . This thesis addresses the data 
cleansing problem of detecting database records that are approximate 
duplicates , but not exact duplicates . An efficient algorithm is 
presented which combines three key ideas. First, the Smith-Waterman 
algorithm for computing the minimum edit-distance is used as a 
domain- independent method to recognize pairs of approximately duplicates . 
Second, the union-find data structure is used to maintain the clusters of 
duplicate records incrementally, as pair-wise duplicate relationships 
are discovered. Third, the algorithm uses a priority queue of cluster 
subsets to respond adaptively to the size and homogeneity of the clusters 
discovered as the database is scanned. This results in significant savings 
in the number of times that a pairwise record matching algorithm is 
applied, without impairing accuracy. Comprehensive experiments on synthetic 
databases and on a real-world database confirm the effectiveness of all 
three ideas. 

This thesis also presents W scEBF scIND, an application that discovers 
scientific papers online over the worldwide web. The size of the web makes 
online information retrieval difficult. In such a setting, it is critical 
to know where to concentrate the search for information. W scEBF scIND uses 
a domain-independent algorithm to match records from different sources for 
the purpose of integrating the information in those sources. We describe 
the design of W scEBF scIND, the integration process, and discovery phase. 
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DECOMPOSITIONS OP JOIN DEPENDENCIES IN THE RELATIONAL DATABASE MODEL 
Author: GYSSENS , MARC 
Degree: DR. 
Year: 1985 

Corporate Source/Institution: UNIVERSITAIRE INSTELLING ANTWERPEN 
(BELGIUM) (0314) 

Source: VOLUME 48/05-B OF DISSERTATION ABSTRACTS INTERNATIONAL. 

PAGE 1415. 189 PAGES 
Descriptors: COMPUTER SCIENCE 
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The decomposition of a relational database has been studied 
extensively during the last fifteen years. The reasons for decomposing a 
relation are obvious: smaller relations are easier to understand, 
independent data should not be stored in the same relation and in 
distributed databases , different components can be stored in different 
sites. The main tool for decomposing a relational database is the 
specification of semantic constraints. The study of these constraints has 
started about 1970 with the introduction of functional dependencies by 
Codd. Many other types were proposed afterwards. This work is mainly 
concerned with joint dependencies, since they are a necessary and 
sufficient condition for a relation to be decomposable. Moreover, many 
authors believe that the structure of a real -world database can be 
expressed by one join dependency and some functional dependencies. 
Unfortunately, the join-operator is very expensive. Therefore we devise an 
algorithm to decompose a join dependency into a set of smaller join 
dependencies as to make integrity checking more efficient. It is shown that 
this algorithm generates final non- redundant decompositions satisfying 
several favorable properties. As said before, a join dependency can be used 
to decompose a database. If this approach is followed, we need to check the 
consistency of the database after each update. Until recently it was 
believed that this could only be done efficiently for acyclic join 
dependencies and hence all research efforts were concentrated on how to 
modify the structure of a database so that it can be described by an 
acyclic join dependency. As an important side-effect of the decomposition 
algorithm mentioned above, a hierarchical classification of join 
dependencies is established according to their degree of cyclicity n, for 
which acyclicity corresponds to n = 2 . It turns out that most problems 
concerning join dependencies remain tractable if the degree of cyclicity is 
restricted, but not necessarily to acyclicity. Hence the major conclusion 
is that in order to design the structure of a database, one must first 
decide which level of cyclicity one considers acceptable and then use only 
join dependencies whose degree of cyclicity does not supersede this level. 
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Abstract: Whenever a database permits textual entry of information- for 
example when data is copied from a paper form-the database is likely to 
contain duplicates and inconsistencies . These duplicates must be 
removed and inconsistencies resolved in order to mine the data or to use 
the data for decision support. We term the domain-specific solution to 
duplicate and inconsistency removal data standardization. In this 
paper, we describe a Name-Address Standardizer, one of a series of 
standardizers that have proven critical in creating a new enterprise-level 
database for the U.S. Customs Service. The standardizers were used to clean 
several legacy databases These standardized databases were 

combined into a central database for which data is now standardized upon 
input. In practice, a standardizer- uses techniques both from natural 
language analysis and from rule-based expert systems. As a result Prolog is 
highly suitable as a basis for standardizers. All Customs standardizers 
were written almost entirely in Prolog and constitute a large programming 
effort: the Name-Address Standardizer contains about 100,000 lines of code, 
including generated parse tables and a fact base. (6 Refs) 
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Abstract: As database technology advances rapidly, there are usually 
heterogeneous databases (hierarchical, relational or object-oriented 
databases ) used by different departments in an enterprise. On the other 
hand, the object-oriented technology is becoming the dominant application 
development paradigm. Thus, there is a data modeling gap between 
applications and heterogeneous databases. In this paper, we present 
different architectural aspects of the system VHDBS (Verteilte Heterogene 
Datenbanksysteme= Distributed Heterogeneous Database System) , which has 
been developed in an ongoing research project. This architecture provides a 
way to support cooperative access to distributed heterogeneous databases 
and to fill the data modeling gap in a distributed heterogeneous 
environment. (12 Refs) 
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Abstract: The paper identifies various problems in integrating schema, 
data and methods in heterogeneous systems and describes the strategies used 
in Pegasus to deal with them. Pegasus is an object-oriented multidatabase 
system that is being developed at Hewlett-Packard Laboratories. The goal of 
the first running prototype is to provide facilities for multidatabase 
applications to access and manipulate multiple autonomous heterogeneous 
object-oriented and relational databases . Pegasus provides both type 
and procedural abstractions to support integration. Type abstraction is 
used to organize objects from multiple databases in categories that are 
related by having some common properties. The common properties of objects 
in different databases may be in incompatible forms or in inconsistent 

states. The system provides procedural abstraction that enables the schema 
integrator to develop arbitrary mappings between data from different 
sources and to write necessary transformations to convert object properties 
to a common form. Because of the importance of existing relational 

databases , some important integration techniques are demonstrated through 
the object-oriented views over SQL databases. (21 Refs) 
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Abstract: In distributed database systems, views independent of 
heterogeneity and distribution of database systems have to be provided. The 
authors adopt a Prolog- like system as the common interface to the database 
systems. Although many researchers have tried to integrate multiple 
database systems at the higher level like the relational model, they 
integrate them at the lower, navigational level. Because every database 
system provides the navigational interface, no additional overhead is added 
to the database systems. They discuss how to get an efficient access 
program from the nonprocedural Prolog-like query on the navigational 
database systems like conventional network database systems, Unix file 
systems, and even relational database systems. Conventional 

optimization methods aim at decreasing the number of access units, i.e. 
pages, records. However, the method aims at reducing not only the number of 
access units but also the number of redundant answers. Also, the access 
program is executed in parallel. (20 Refs) 
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Semantic heterogeneity has been investigated in connection with the 
database and data dictionary integration efforts that support Command, 
Control, Communications and Intelligence (C3I) systems. Based on this 
investigation, a systematic approach to the resolution of semantic 
heterogeneity has been developed and illustrated with examples derived from 
the component C3I system databases in a federation. A methodology is 
introduced for resolving semantic conflicts to construct a tightly coupled 
federated database system by facilitating the development of a global 
schema derived from the individual schemas of the component databases. This 
methodology of resolving semantic conflicts results in the formation and 
modification of synonym- homonym groups (SHG) , a concept introduced and 
developed in the paper. A detailed analysis using a three-phased procedure 
is introduced, with each phase exploring semantic heterogeneity at 
progressively finer levels of information granularity For the purpose of 
illustration, the simplest case of a two-component database integration 
into a tightly coupled federated database system is considered, but the 
methodology can be generalized to include three or more component databases 
in a federated system. It also can be applied in the case of a fully merged 
database integration involving any number of component databases . A 
variety of inconsistencies were identified using the heuristics 
implemented in the algorithm. Resolutions of the problems arising from 
semantic heterogeneity are suggested, and directions for future research 
are explored. 
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We introduce an abductive method for coherent composition of distributed 
data. Our approach is based on an abductive inference procedure that is 
applied on a meta-theory that relates different , possibly inconsistent , 
input databases . Repairs of the integrated data are computed, resulting 
in a consistent output database that satisfies the meta-theory. Our 
framework is based on the A-system, which is an abductive system that 
implements SLDNFA-resolution . The outcome is a robust application that, to 
the best of our knowledge, is more expressive (thus more general) than any 
other existing application for coherent data integration. 

English Descriptors: Inference; Abduction; Logical programming; Knowledge 
base; Relational database 

French Descriptors: Inference; Abduction; Programmation logique; Base 
connaissance; Base donnee relationnelle 



Classification Codes: 001D02B07D 



Copyright (c) 2002 INIST-CNRS. All rights reserved. 



File 256 :TecInf oSource 82-2005/Feb 
(c) 2005 Info. Sources Inc 

? ds 

Set Items Description 

51 10555 DATABASE? OR DATABANK? OR DATA ( ) (BASE? OR BANK?) OR DB OR - 

RDB OR DBMS OR OODB OR ORACLE OR SYBASE OR SQL OR DATA { ) (BASE? 
OR BANK?) 

52 869 S1(4N) (MULTIPLE OR MULTIPLICITY OR PLURAL OR PLURALITY OR - 

MIRROR? OR MANY OR SEVERAL OR DIFFERENT OR VARIOUS OR VARIETY 
) 

53 9164 COMBINE? OR COMBINING OR CONSOLIDAT? OR DISTRIBUTED OR SHA- 

RED OR NETWORKED OR GROUPWARE? 

54 1609 REDUNDANT? OR USELESS OR DUPLICAT? OR DEDUP? OR WASTED OR - 

INCONSIST? OR IRRELEVAN? OR GAP OR GAPS 

55 601 GLOSSAR? OR DICTIONAR? OR LESICOG? OR LEXICON? OR THESAURI? 

OR WORDLIST? OR WORD ( ) (LIST OR LISTS) 

56 255 DATASTRUCTUR? OR DATA ( ) (STRUCTUR? OR LINK? OR TEMPLAT? OR - 

DESIGN? OR SET OR SETS) 

57 9963 UPDAT? OR UP { ) (DATE? OR DATING) OR VERSION? OR REVIS? 

58 637 (LEGACY OR OLDER? OR PRIOR? OR DEFUNCT? OR RELATIONAL) (3N) - 

SI 

59 8 S2(S)S3(S)S4 

510 9 S3(S)S4(S)S8 

511 4 (S9 OR S10) (S) (S5 OR S6 OR S7) 



11/3, K/l 

DIALOG (R) File 256 : Teclnf oSource 
(c) 2005 Info. Sources Inc. All rts . reserv. 

01099368 DOCUMENT TYPE: Product 

PRODUCT NAME: MarketGenius Suite (099368) 

Anchor Software LLC (598275) 
730 E Park Blvd #104 
Piano, TX 75074 United States 
TELEPHONE: (972) 881-2424 

RECORD TYPE: Directory 

CONTACT: Sales Department 

REVISION DATE: 20020830 

. ..s FASTForward (TM) system. FASTForward offers change of address (COA) 
features that streamline mailing list updates . Addressf orward includes 
automated job queue features that improve throughput. The AddressVerif ier 
module offers real-time updating and verification of address information. 
It can be embedded in Internet, online call center, point-of-sale (POS) , or 
batch processing applications. MarketGenius ' s MaxDup+ eliminates duplicate 

records. It includes consumer and business merge and purge features. The 
MaxConvert+ component converts multiple... 

...and integrate with Microsoft Excel and other spreadsheet applications. 
The program can evaluate information across multiple time periods. 
MarketGenius 1 s Database Wizard creates, consolidates , and updates 
databases, employing Anchor Software's Image Crusher (TM) technology to 
eliminate duplicates . The module lets users define update actions and 
key relationships. Finished tables can be referenced by any SQL query 
engine, providing... 
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. . . own development methodology, moving our development effort from the 

mainframe to the workstation, developing a relational database 
perspective, replacing duplicate , free-standing databases with a shared 
data environment and installing CASE tools that would take us to 
application generation directly from. . . 
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TEXT: 

You heard it here first -Windows is dead. Operating systems ( including 
Windows) have been made irrelevant by cross -platform middleware such as 
relational databases from Oracle , Sybase , and Informix, groupware 
like Notes, and Domino from Lotus, and programming environments like Java 
from Sun Microsystems. Am. . . 
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... accessing the same data at the same time you won't have to worry 

about inconsistent results occurring when parallel work takes place on 
different base data sets . eSSbase also consolidates data from 
multiple spreadsheet sources and relational database systems, which 
increases user productivity by making the data available as a dynamic 
resource, instantaneously. . . 
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ABSTRACT: Object-oriented programming can be used to consolidate bulky 
corporate databases developed by different business units. These 
databases often contain redundant information, such as customer data 
stored in the sales, accounting and marketing departments. Problems of data 

inconsistency can also be resolved by developing reusable data 
structures , or objects, which represent data fields shared throughout 
the corporation. These objects are defined in object programming tools such 
as Borland International... 
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. . .ABSTRACT: SQL Server and Oracle databases, rather than limiting them to 
host operations through IBM's Distributed Relational Database 
Architecture (DRDA) . DataHub software lets database managers duplicate 
and move data on certain databases, run utilities, authorize users and 
examine database structures. The... 
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517 41 S16 OR S14 

518 37 S17 NOT AD>20010326 

519 37 ID PAT (sorted in duplicate/non-duplicate order) 

520 36 ID PAT (primary/non-duplicate records only) 
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...SPECIFICATION access one of the copies of the file. The Coda system also 
assumes that the version stored in the client's cache is the correct 
version , so situations in which both the original and the duplicate were 
altered are not properly. . . 

...File System) . Coda provides no solution to the more general problem of 
detecting and resolving inconsistencies in a distributed database 
that can include objects other than file and directory descriptors. 

Various approaches to distributed database replication attempt to 
ensure consistency between widely separated replicas that collectively 
form the database. Examples... 
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A method and apparatus for detecting and analyzing errors in a 
communications system is described. The method employs expert system 
techniques to isolate failures to specific field replaceable units and 
provide detailed messages to guide an operator to a solution. The expert 
system techniques include detailed decision trees designed for each 
resource in the system. The decision trees also filter extraneous sources 
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...SPECIFICATION word with bit 14 set to 0 specifies the end of the table. 
A period word specifies the length of the on or off pulse and is 
calculated as: 

MS/0... the current node address. Otherwise add the failed offset to the 
current node address. Then, update the data base record to this new 
address. Continue by using the node type to... through the Hardware Error 
Table (ERRH) and the Failing Resource Table (FRT) . 

Each FRT is updated when a change occurs. The failing resource 
manager also performs priority testing after the system. .. resources, the 
resource is taken out of service, and the entry in the FRT is updated 
to reflect the removal. Once the resource is removed from service, the 
same series of . . . 

. . .When the resource is returned to service, the threshold alarm software 
is informed via an update to the FRT of the return to service of the 
failing resource. 
Multiple Failing Channels... 

. . .An entry is made to the FRT and the out of service state is also 

updated in the ERRH. In other words, there is always a failing resource 
record in the . . . 

...the failure alarms of resources. The two pieces of information given by 
FRM are the update -count (FR. . . 

. . . UPDATE . . . UPDATE 



.card type by one whenever the failure alarm of the card type needs to be 
updated . TA is kicked into action when it observes a non-zero FR. . . 
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English Abstract 

In the distributed enterprise application integration system, modularized 
components located on multiple hosts are centrally managed so as to 
facilitate communication among application programs. Collaboration 
services traditionally associated with a central server, such as, for 
example, message queues, message publishers/subscribers, and message 
processes, are instead distributed to multiple hosts and monitored by a 
central registry service. This system allow configuration management to 
be performed in a central location using a top-level approach, while 
implementation and execution tasks are distributed and delegated to 
various components that communicate with the applications. 
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French Abstract 

On decrit un systeme, un procede et un article manufacture qui 
constituent une structure de chaine d ! approvisionnement fondee sur le 
reseau. L' installation d f un service est geree au moyen d'un reseau. La 
demande et 1 * approvisionnement des off res de fabricant sont planifies au 
moyen du reseau et les commandes relatives aux offres du fabricant sont 
egalement gerees au moyen du reseau. Le reseau est egalement utilise pour 
gerer les actifs sur le reseau, y compris pour effectuer la maintenance 
et le service pour les actifs de reseau au moyen du reseau. 
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Detailed Description 

Detailed Description 
... a 

preferred embodiment of the present invention; and 

Figure 52 is a flowchart showing a Data Mining Process in accordance 
with a preferred embodiment of the present invention. 

Figure 53 is a. . .1 1 

can share the rapid technical advantages of packet technologies, and 
improve their cost structure , and at the same time offer new services 
on the "Next Generation Network" . 
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A universal data and software structure and method for an Any-to-Any 
computing machine in which any number of any components can be related to 
any number of any other components in a manner that is not intrinsically 
hierarchical and is intrinsically unlimited. The structure and method 
includes a Concept Hierarchy; each concept or assembly of concepts is 
uniquely identified and assigned a number in a Numbers Concept Language 
or uniquely identified in a Non-numbers Concept Language. Each Component 
or assembly of Components is intrinsically related to all other data 
items that contain common or related components. 

French Abstract 

L' invention concerne une structure de donnees et de logiciel universelle 
ainsi qu'un procede de machine informatique toute categorie dans laquelle 
des composants, quels qu'ils soient et quel que soit leur nombre, peuvent 
etre rattaches a d'autres composants, quels qu'ils soient et quel que 
soit leur nombre, d'une maniere intrinsequement non hierarchisee et 
intrinsequement illimitee. La structure et le procede comportent une 
hierarchie conceptuelle ; chaque concept ou ensemble de concepts est 
identifie de maniere unique et recoit un numero dans un langage 
conceptuel de nombres ou dans un langage conceptuel de non-nombres. 
Chaque composant ou ensemble de composants est intrinsequement rattache a 



tous les autres elements de donnees qui contiennent des composants 
communs ou associes . 
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Claims 

Claim 

they can and can not do, using the "Able" field or record type in the 
Data Relation Table. Thus, in the nonsense example given at the 
beginning 'e-mail the modem. . .The user will reply'yes' 

Further routines that are the province of associated software can revise 
the definitions of 'telephone' and 'roam'. It can also be envisaged that 
software can be... be singular or plural - all the letters from Joe about 
bananas' for example. A 'Unique, Data specification is unique not in 
the sense that it unique means 'one' item, but 'unique... of data that 
need to be recorded in order to ensure a human's Unique data 
Specification can be met - if implemented in the appropriate software. 9 
Concept Language Requirements. C. . .does in the state of the ar,t, a list 
that is impossibly long and therefore useless , as well as taking an 
inordinate time to execute. 
88 

However, if the search is... a Language X Concept 

Language - i.e. a Concept Language that is a Concept Language version 
of that same language. Do this by following the steps listed and 
explained below. This... be described 
later) : 

1 1) Data Entry Compression Coding. The nature of the type of data 
entry 

(Command, Command query, or Data for recording) can change the meaning of 
the word ... Symbols or Statements in earlier steps. The difference 
between the contraction and the un-contracted versions is the relative 
formality of the two. This can be useful if the Formality Data . . . to 
record all Data Class values that are available for each computer event 
in a database field of their own. Then, a simultaneous selection of a 
number of values from dif f erent . . . this means that a particular appears to 
classify in another data Category also, but the different meaning for 
the other data Category is s9mi -invisible and hard to detect. A prime . A 
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English Abstract 

A system, method, and article of manufacture are provided for managing 
constants in a computer program. A plurality of constant names are 
provided. Each of the constant names has a corresponding constant value. 
The constant names are grouped into constant classes based on an entity 
which the constant values represents. Access is allowed to the constant 
values by receiving a call including the corresponding constant name and 
corresponding constant class. 

French Abstract 

L 1 invention porte sur un systeme, un procede et un article de gestion des 
constantes d ' un programme d 1 ordinateur . On etablit les noms de 
differentes constantes a chacun desquels correspond la valeur d'une 
constante, puis les noms sont regroupes par classes de constantes en 
fonction d'une entite representant les valeurs des constantes. L f acces a 
une valeur de constante est autorise lors de la reception d ! un appel 
comprenant le nom et la classe de la constante correspondante . 
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Fulltext Availability: 
Detailed Description 
Detailed Description 

. . . the naming service allowing access to a plurality of different sets of 
services from a plurality of globally addressable interfaces in 
accordance with an embodiment of the present invention; Figure 101... 

. . .an activity module in accordance 

with an embodiment of the present invention; 

Figure 124 illustrates multiple interfaces to an application including 
a handheld device, a desktop 

PC, and a telecommunications device ... invention; 

Figure 163 illustrates a flowchart for a method for organizing data 
access among a plurality of business entities in accordance with an 
embodiment of the present invention ; 

Figure 164 illustrates .. .variable) throughput QoS based on available 
network capacity.) The following list provides a description of various 
Quality of Service parameters. 

179 

connection establishment delay - time between the connection request and 
...network. The Web server typically provides an option to verify whether 
the page has been updated since the time it was placed in the cache, 
and if it has to get the latest update . 
Possible Product Options 

Netscape Enterprise Web Server; Microsoft Internet Information Server 
(US) ; Oracle Webserver The. . . 



. . .219 

A free add-on product for NT Server that implements basic HTTP services. 
Future versions of NT Server (4.0 and beyond) will have HTTP features 
built directly into the... 



...or relevant information is available. 



Asynchronous push/pull services can be useful for pro-actively updating 
customers on changes in order status or delivering information on new 
products or services they. . . 

.paid to performance as batch systems usually must be processed within 
strict batch windows. 



The design of batch architectures is often complicated considerably by 
the fact that batch jobs must be... from SQL Solutions. SQR is a robust 
report generator designed to be used with SQLbased relational 
databases . SQR insulates the developer from programming in a third 
generation language by providing a higher. . .a previously generated 
report . 

227 

All application-defined report writer modules invoke an API to update 
the report status table with a status of "completed" after a report has 
been produced. . . 

.module generates the report, prints it if specified in the original A-PI 
request, and updates the status in the report status table. 

A request to print a report proceeds as ... specif ic report sections). 

Search capabilities (allows users to search report for occurrence of a 
specific data stream) . 

10. Report Level Security: Reports may occasionally contain sensitive 
inf ori-nation. It is theref ore . . . it possible to accommodate a 
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satisfy new business requirements new product line solely by updating 
by replacing or modifying certain the Product component, 
components with minimal impact 
to others . 

Adaptable . . . 

.new user interface while reusing 

the core application, existing components. 

Maintainable Making it easy to update an Making it easy to add a new 
application by reducing the area of customer. . . 
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Detailed Description 
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services from a plurality of globally addressable interfaces in 
accordance with an embodiment of the present . invention; Figure 101... with 
other design and development tools, presentation services (graphics, 
multi -media, etc.), data access services ( databases and database API 
libraries) , distribution services (distributed TP monitor) , transmission 
services (SNA, HLLAPI, etc.), data ... variable) throughput QoS based on 
available network capacity.) The following list provides a description of 
various Quality of Service parameters. 

connection establishment delay - time between the connection request and 
a confirm. . .The report architecture should allow preview of reports 
online from a user's intelligent workstation prior to actual 
distribution. Ideally, the report architecture itself would provide 
support for online preview of... option is to have a system administrator 
or the user physically install new applications and update existing 
applications on each client machine. 
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User accessible database organizing method for minimizing duplicate 
information amount, involves using data retrieval components for 
accessing index value and data entries 

Patent Assignee: BULL HN INFORMATION SYSTEMS INC (HONE ) 

Inventor: BEAUCHESNE R C 
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Abstract (Basic) : US 6128626 A 

NOVELTY - A product directory index and data table structures for 
storing number of respective index and data entries are stored in 
tables (200-1-200-4) in database (200) . A set of data retrieval 
components are provided in database for accessing index value and data 
entries and database is accessed via multiple client system to 
generate bill of material document for board assembly products. 

DETAILED DESCRIPTION - The product directory index table structure 
storing index value entries coded in preset manner representing all 
board assembly products being currently manufactured is stored in a 
first table in database. Each index value entry contains key values 
having file key value coded for extracting data entries from other 
tables contained in database including information related to 
particular board assembly product and file version value coded for 
designating points within the database table where entries are added or 
deleted. Data table structures storing data entries containing 
different assembly board product related information used for 
generating bill of materials (BOM) documents used in manufacturing 
board assembly products is stored in other tables in the database. Each 
entry contains key values defining data selection criteria used in 
extracting entries utilized by particular board assembly product. The 
key values in each data entry includes ON and OFF field values which 
are set for identifying version of board products with and without 
data entry. The ON and OFF field values are used along with file 
version value for defining data selection criteria. The database is 
accessed via multiple client system having copy of data retrieval 
components to generate BOM document for board assembly products. An 
INDEPENDENT CLAIM is also included for software database system. 

USE - For organizing database for minimizing storage amount of 
duplicate information used for generating bill of material documents 
for manufacturing printed circuit board assembly products. 

ADVANTAGE - Enables operator or user to access database for 
obtaining information pertaining to particular PCB assembly. 

DESCRIPTION OF DRAWING (S) - The figure shows components of database 
data retrieval software. 

Database (200) 

Tables (200-1-200-4) 

pp; 32 DwgNo 2/6 
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Replication tracking apparatus for management of distributed data 
processing system - employs tracking function that checks duplication 
of shadowed databases between different servers is synchronised and 
that contents of databases with same ID are identical 
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Abstract (Basic) : GB 2327781 A 

The replication tracking apparatus operates within a distributed 
system monitor (DSM) server (10) connected via a network to several 
application servers (40-70) . The DSM sever has a database replication 
tracking function. Each application server has a database application 
(e.g. Lotus Notes (RTM) ) with several executable data management tasks, 
e.g. Notes replicator task. 

As replication tasks are carried out by each application server the 
DSM server uses its tracking function to check the tasks are 
synchronised in each application server. If they are out of 
synchronisation the DSM server sends and alert to the administration 
terminal. The tracking function also verifies that databases with the 
same replica ID have the same contents. 

USE - In distributed network of several severs and clients 
running e.g. Lotus Notes (RTM) where replication tasks are carried out 
required when data stored in different databases must be shadowed 
between different application servers. 

ADVANTAGE - Ensures replication of databases in server is carried 
out at correct frequency to accommodate regularity with which end 
clients modify them. 
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Database management system using smoke duplicate database facility - 
comprises multiple remote computers with backup database in which 
updated data from local computer are maintained synchronisingly 
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Abstract (Basic) : US 5799323 A 

The system comprises a local computer (110) which is connected with 
multiple interconnected remote computers (122,162) through a 
communication channel. In each local computer, a local database is 
maintained which is updated by an application program. The modified 
application program is maintained as the audit record in a master audit 
trail by a transaction manager. The modified application program record 
is extracted by an independent extractor processor (130) and 
transmitted to the remote computer in an asynchronous mode. 

Based on the received and record, the back up database in the 
remote computer is updated when the credit record is received in one 
computer, the copying instruction along with the backup database 
synchronization signal is transmitted to other remote computers. Based 
on the synchronization signal. The updated backup database is copied 
into other databases of the remote computer and maintained. 

ADVANTAGE - Enables data utilisation report backup database for 
report generation and inquiry task. Improves resource data utilisation 
efficiency. Enables synchronization of all backup databases in remote 
computer . 
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The apparatus includes a search condition input unit (101) through 
which an user inputs search request. This request is input to multiple 
searching units (105) through a parallel convertor (104) . A search 
monitoring unit (108) monitors searching of information of searching 
units from multiple database (107). 

The information extracted by each searching unit is combined and 
then displayed in a particular format by an information formatting unit 
(104) and a standard structure extractor. If two information searched 
from database are found to be identical, any one information is deleted 
by a redundant data searching unit (115). 

ADVANTAGE - Reduces waiting time during searching of information 
from multiple database . 
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Hierarchical data distribution system for distributed database - has 
top level master database system, bottom level client servers, each with 
its own database copy and intermediate database level 
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Abstract (Basic) : US 5721914 A 

The data distribution system (HDDS) includes a top level master 
database, bottom level client servers, each with its own database copy, 
and at least one intermediate database level. The entry of update 
data into the system invokes the distribution process. First, the 
master database system is updated , then, the master database system 

updates several database systems at the first intermediate 
database level. Each database system at the first intermediate 
database level then updates several database systems at the next 
lower database level. 

The distribution process performs any necessary reformatting, data 
assembly and data view processing before transmitting the update 
data. Each higher level database system must update fewer lower level 
servers and overall update performance is improved. If communications 
to a destination server are not functional, the distribution system 
detects this immediately because it is unable to establish 
communications with the destination server. 

ADVANTAGE - Transitional inconsistency is reduced because the 
distribution process is invoked for all destination servers at the same 
time. System resources are not wasted because communications are only 
established when there is data to be updated . 
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ABSTRACT 



PROBLEM TO BE SOLVED: To provide the short-time processor while maintaining 
partial data consistency as to tables in mutually different table structure 
between an existent operation system RDBMS (relational data base 
management system) and different kind of DBMSs by using a replication 
mechanism without developing an additional function for the RDBMS. 

SOLUTION: The processor comprises a judging and extracting function part 
23, a duplicating function part 24, an SQL statement converting process 
function part 25, and an SQL statement loading function part 25 and then 
differences of final results of records are extracted from a user 
definition table 22, a temporary table 14, and an update log table 15 of 
the relational data base management system 11; and extracted data are 
changed and processed into SQL statements, which are loaded. 



COPYRIGHT: 



(O 2000, JPO 



19/5/39 (Item 39 from file: 347) 

DIALOG (R) File 347:JAPIO 

(c) 2005 JPO & JAPIO. All rts. reserv. 

05121553 **Image available** 

DEVICE AND METHOD FOR REFLECTING DUPLICATED DATA BASE 



PUB. NO. : 
PUBLISHED: 
INVENTOR (s) : 

APPLICANT (s) : 

APPL. NO. : 
FILED: 
INTL CLASS: 
JAPIO CLASS: 



08-077053 [JP 8077053 A] 
March 22, 1996 (19960322) 
MORISHITA SHINJI 
KOBAYASHI NOBUYUKI 

NIPPON TELEGR & TELEPH CORP <NTT> [000422] (A Japanese 
Company or Corporation), JP (Japan) 
06-215125 [JP 94215125] 
September 08, 1994 (19940908) 
16] G06F-012/00 ; G06F-017/30 

4 5.2 (INFORMATION PROCESSING -- Memory Units); 4 5.4 
(INFORMATION PROCESSING — Computer Applications) 



ABSTRACT 

PURPOSE: To enable duplication processing between duplicated data 
bases which are different in physical structure with a low load by 

using information to which logical position information used for update 
processing is added as reflection information sent from a duplication 
source data base device. 

CONSTITUTION: This device has the duplication source data base 100a 
having a reflection information generating means 110a which generates and 
outputs the reflection information by after- update information and 
logical position designation information obtained at the time of data base 
update processing and a duplication destination data base device 100b 
having a reflection update means 110b which performs reflection 
processing for a data base on the basis of the reflection information 
generated by the duplication source data base device 100a. Consequently, 
the duplication destination data base device 100b can instruct update 
to a data base management system directly without using an application 
program for update and reflection. Then both the duplication source 
data base device 100a and duplication destination data base device 100b 
can evade the load of processing. 
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Abstract (Basic) : US 20030061226 Al 

NOVELTY - A data loader (105) transfers data from existing data 
sources (113) to specified data objects. Several structural hierarchies 
created by data loading wizards, are loaded with information and the 
data objects within the hierarchies are linked. The data loader 
verifies the validity of the data items and the invalid data are 
transferred for verification into a predetermined storage area. 

USE - For providing enterprise business intelligence, data analysis 
and data warehousing services. 

ADVANTAGE - Simplifies and automates the process of data loading, 
updating and maintaining the data contents at low cost and time. The 
data loading tasks are scheduled to run automatically at regular 
intervals and for scheduled time. Optimizes and simplifies the linking 
process and stores the invalid data in a specific area that facilitates 
easy access in the future. Enables the user to obtain value from 
imperfect data and obtain reliable solutions based on the imperfect 
data. The structures like multiple relational database tables 
providing unlimited scalability virtually, are implemented efficiently. 
Performs data loading rapidly and efficiently. Enables handling even 
imperfect data efficiently and supporting multiple servers and data 
sources using the data loader. 

DESCRIPTION OF DRAWING (S) - The figure shows the functional block 
diagram of the data storage and analysis system. 

data loader (105) 

data source (113) 
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Data transformation method for database management system, involves 
invoking transformation program transforming source data, to generate 
target data using program template parameters, under control of business 
view 
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Abstract (Basic): US 20010049694 Al 

NOVELTY - Program template containing parameters is retrieved, and 
the business view is invoked with the retrieved program template. A 
transformation program that transforms source data, is invoked to 
generate target data, using predetermined parameters of program 
template, based on control of business view . 

DETAILED DESCRIPTION - INDEPENDENT CLAIMS are also included for the 
following : 

(a) Data transformation apparatus; 

(b) Computer readable storage medium storing data transformation 
program 

USE - For transformation of data in relational database 
management system (RDBMS) in several factories. 

ADVANTAGE - Enables user to define reusable templates that enable 
user to provide user-specific parameters to data warehouse program. 

DESCRIPTION OF DRAWING (S) - The figure shows the block diagram 
illustrating exemplary hardware environment, 
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Star schema search system for relational database in data 

warehouse , acquires storage address of applicable data with reference to 

data coupling table, during data search in database 

Patent Assignee: NEC CORP (NIDE ) 

Number of Countries: 001 Number of Patents: 001 

Patent Family: 

Patent No Kind Date Applicat No Kind Date Week 

JP 2001265783 A 20010928 JP 200082427 A 20000323 200202 B 

Priority Applications (No Type Date) : JP 200082427 A 20000323 
Patent Details: 

Patent No Kind Lan Pg Main IPC Filing Notes 
JP 2001265783 A 8 G06F-017/30 

Abstract (Basic): JP 2001265783 A 

NOVELTY - A data coupling table is generated corresponding to the 
relation between several tables in a database , according to the 
star schema of the database. During data search in database, the 
storage address of the applicable data is acquired with reference to 
the data coupling table. The searched relevant data is read out from 
the database, based on the storage address. 

DETAILED DESCRIPTION - An INDEPENDENT CLAIM is also included for 
recorded medium stored with star schema search program. 

USE - For relational database in data warehouse . 

ADVANTAGE - Data search performance is improved by maintaining data 
coupling between start schema in relational database . 

DESCRIPTION OF DRAWING (S) - The figure shows the block diagram of 
star schema search system. (Drawing includes non-English language 
text) . 
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Relational object provision system for database management system, 
applies predetermined relationship methods on corresponding objects to 
express their relationship between each base object 
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Abstract (Basic): US 6317749 Bl 

NOVELTY - Several base objects in object oriented data warehouse 
, describe several integration sources including relational 
database table, conversion and reporting application. Several 
relational objects corresponding to base objects, include relationship 
method which is to be applied on corresponding objects to express 
relationship between each other. 

DETAILED DESCRIPTION - INDEPENDENT CLAIMS are also included for the 
following : 

(a) Machine readable medium which stores program for relationship 
object provision; 

(b) Object-oriented data warehouse ; 

(c) Computerized relational object provision method 

USE - For providing relational objects in multi-user system for 
database management system. 

ADVANTAGE - Provides reusability for existing items, and simplifies 
maintenance of the system, and any number of different types of 
integration sources can be supported by the system. 

DESCRIPTION OF DRAWING (S) - The figure shows the block diagram of 
object-oriented data warehouse . 
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Data mining apparatus for extracting correlation rule within 
relationship database - includes procedure file with series of procedure 
for converting relationship database into item database, after which 
correlation rule between items of database is extracted and 
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Abstract (Basic) : JP 11250084 A 

NOVELTY - Procedure file (312) records procedure for converting 
relational database into item database . A mining executing unit 
extracts correlation rule between items of database and outputs it as 
sequence rule file which is displayed. Procedure file edit unit 
processes attribute value of database, in order. DETAILED DESCRIPTION - 
Procedure edit unit performs Procedures such as digitization, grouping, 
displacing non- attribute value, deleting attributes, selecting 
records, itemization, amendments, deletion and modification etc. 
Procedure file application setting unit arranges one or more procedure 
file in order, used by preprocessing executing section (302) . 

USE - For extracting correlation rule within relationship database. 

ADVANTAGE - Since content of procedure file is changed and applied 
to relational database , several preprocessing praxis can be 
easily repeated. Since hierarchical structure obtained can be displayed 
by effecting conversion based on content of interval data dictionary, 
structure of data obtained by relationship database with application of 
procedure file can be understood with ease. DESCRIPTION OF DRAWING (S) - 
The figure shows detailed block diagram of data mining apparatus. 
(302) Preprocessing executing section; (312) Procedure file. 
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